RESULT 1 / \ 

TMS5_HUMAN ) 

ID TMS5_HUMAN \ STANDARD; PRT; 4 57 AA. 

AC Q9H3S3; ^ 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2,001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transmembrane protease, serine 5 (EC 3.4.21.-) (Spinesin) . 

GN TMPRSS5 . , 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX PubMed=11741986; 

RA Yamaguchi N. , Okui A., Yamada T. , Nakazato H., Mitsui S.; 

RT "Spinesin/TMPRSS5 , a novel transmembrane serine protease, cloned from 

RT human spinal cord."; 

RL J. Biol. Chem. 277:6806-6812(2002). 

CC SUBCELLULAR LOCATION: Type II membrane protein (Potential). 

CC -!- TISSUE SPECIFICITY: Brain- specif ic . Predominantly expressed in 

CC neurons, in their axons, and at the synapses of motoneurons in the 

CC spinal cord. 

CC -!- SIMILARITY: Belongs to peptidase family SI. 

CC -!- SIMILARITY: Contains 1 SRCR domain. 

CC 

CC This SWISS -PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB028140; BAB20375.1; 

DR HSSP; P00763; IDPO . 

DR Genew; HGNC: 14 908; TMPRSS5 . 

DR MIM; 606751; 

DR MEROPS; SOI. 313; -. 

DR InterPro; IPR009003; Cys_Ser_trypsin. 

DR InterPro; IPR001254; Peptidase_Sl . 

DR InterPro; IPR001314; Peptidase_SlA. 

DR InterPro; IPR001190; Srcr_receptor . 

DR Pfam; ?F0008 9; trypsin; 1. 

DR PRINTS; PRO 072 2; CHYMOTRYPSIN . 

DR SMART; SM00020; Tryp_SPc; 1. 

DR PROSITE; PS50240; TRYPSIN_DOM; 1. 

DR PROSITE; PS00134; TRYPSIN_HIS; 1. 

DR PROSITE; PS00135; TRYPSIN_SER; 1. 

DR PROSITE; PS0042 0; SRCR_1 ; FALSE_NEG. 

DR PROSITE; PS50287; SRCR_2 ; FALSE_NEG. 

KW Hydrolase; Serine protease; Transmembrane; Signal -anchor ; 

KW Glycoprotein. 
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457 AA; 


49574 


MW; 64406AB4985A2651 CRC64; 



Query Match 100.0%; Score 2481; DB 1; Length 457; 

Best Local Similarity 100.0%; Pred. No. 5.6e-194; 

Matches 457; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 
Db 



1 MSLMLDDQPPMEAQYAEEGPGPGIFRAEPGDQQHPISQAVCWRSMRRGCAVLGALGLLAG 60 

lllllllllllllllllllllllllllllllillllllllllilllllllllllllllll 

1 MSLMLDDQPPMEAQYAEEGPGPGIFRAEPGDQQHPISQAVCWRSMRRGCAVLGALGLLAG 60 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



61 AGVGSWLLVLYLCPAASQPISGTLQDEEITLSCSEASAEEALLPALPKTVSFRINSEDFL 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

61 AGVGSWLLVLYLCPAASQPISGTLQDEEITLSCSEASAEEALLPALPKTVSFRINSEDFL 12 0 
121 LE AQ VRDQPRWLLVCHEGWS PALGLQ I CW S LGHLRLTHHKGVNLTD I KLNS SQE FAQL S P 180 

IIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

121 LEAQ VRDQPRWLLVCHEGWS PALGLQ I CWS LGHLRLTHHKGVNLTD I KLNS SQE FAQL S P 180 
181 RLGGFLEEAWQPRNNCTSGQWSLRCSECGARPLASRIVGGQSVAPGRWPWQASVALGFR 24 0 

IIIIMIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIII 

181 RLGGFLEEAWQPRNNCTSGQWSLRCSECGARPLASRIVGGQSVAPGRWPWQASVALGFR 24 0 
241 HTCGGSVLAPRWWTAAHCMHSFRLARLSSWRVHAGLVSHSAVRPHQGALVERIIPHPLY 300 

IIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

241 HTCGGSVLAPRWWTAAHCMHSFRLARLSSWRVHAGLVSHSAVRPHQGALVERIIPHPLY 300 



Qy 
Db 

Qy 

Db 

Qy 

Db 



3 01 SAQNHDYDVALLRLQTALNFSDTVGAVCLPAKEQHFPKGSRCWVSGWGHTHPSHTYSSDM 360 

IIIMIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

3 01 SAQNHDYDVALLRLQTALNFSDTVGAVCLPAKEQHFPKGSRCWVSGWGHTHPSHTYSSDM 3 60 

3 61 LQDTWPLFSTQLCNSSCVYSGALTPRMLCAGYLDGRADACQGDSGGPLVCPDGDTWRLV 42 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 
3 61 LQDTWPLFSTQLCNSSCVYSGALTPRMLCAGYLDGRADACQGDSGGPLVCPDGDTWRLV 42 0 

421 GWSWGRACAEPNHPGVYAKVAEFLDWIHDTAQDSLL 457 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIII 
421 GWSWGRACAEPNHPGVYAKVAEFLDWIHDTAQDSLL 45 7 



RESULT 1 
S00845 

hepsin (EC 3.4.21.-) - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1988 #sequence_revision 31-Dec-1988 #text_change 18-Jun-1999 
C; Accession: SO 084 5 

R;Leytus, S.P.; Loeb, K.R. ; Hagen, F.S.; Kurachi, K. ; Davie^ E.W. 
Biochemistry 21 , 1067-1074, 1988 

A;Title: A novel trypsin-like serine protease (hepsin) with a putative 

transmembrane domain expressed by human liver and hepatoma cells. 

A; Reference number: S00845; MUID : 88209431; PMID : 2 835076 

A; Accession: SO 084 5 

A; Molecule type: mRNA 

A; Residues: 1-417 <LEY> 

A; Cross-references : EMBL :X07732 / NID:g32063; PIDN : CAA30558 . 1 ; PID:g32064 
C; Genetics: 

A; Gene: GDBiHPN; TMPRSSl; hepsin 

A; Cross-references : GDB:135685; OMIM : 14244 0 

A;Map position: 19qll-19ql3 . 2 

C; Superfamily : hepsin; trypsin homology 

C;Keywords: hydrolase; liver; serine proteinase; transmembrane protein 
F; 23-45/Domain: transmembrane #status predicted <T]yiN> 
F; 163-400/Domain: trypsin homology <TRY> 

F; 188-204, 291-359, 322-338, 349-381/Disulfide bonds: #status predicted 
F;203,257, 353/Active site: His, Asp, Ser #status predicted 

Query Match 24.3%; Score 602.5; DB 1; Length 417; 

Best Local Similarity 33.3%; Pred. No. 2.9e-41; 

Matches 13 9; Conservative 61; Mismatches 155; Indels 63; Gaps 11; 

Qy 53 GALGLLAGAGVGSWLLVLYLCPAASQPISGTLQDEEITLSCSEASAEEALLPALPKTVSF 112 

M II I II :| I : n= h :| I II 
Db 24 GTLLLLTAIGAASWAIVAVLLRSDQEPLYPV QVS SADARLMVFDKT 69 

Qy 113 RINSEDFLLEAQVRDQPRWLLVCHEGWSPALGLQICWSLGHLRLTHHKGVNLTDIKLNSS 172 

: I hi : : | :| || | ::: | : 

Db 70 EGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGT 114 

Qy 173 QEFAQLSPRLGGFLEEAWQPRN NCTSGQWSLRCSECGARPL-ASRIVGG 221 

I ::| I :| h I HI I I IIIM 

Db 115 SGFF CVDEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGG 166 

Qy 222 QSVAPGRWPWQASVALGFRHTCGGSVLAPRWWTAAHCMHSFRLARLSSWRVHAGLVSHS 2 81 

: : MINI h I lllhh Ihlllll I II III II h : 

Db 167 RDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPE-RNRVLSRWRVFAGAVAQA 225 

Qy 282 AVRPHQGAL-VERIIPHPLY SAQNHDYDVALLRLQTALNFSDTVGAVCLPAKEQ 334 

: II I h :: I I ==: = hlh 1=1 - = Mill I 

Db 22 6 S--PHGLQLGVQAWYHGGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQ 2 83 

Qy 33 5 HFPKGSRCWVSGWGHTHPSHTYSSDMLQDTWPLFSTQLCNSSCVYSGALTPRMLCAGYL 3 94 

I I hllhl : : :|h Ih I Ul : I = hi MM 
Db 284 ALVDGKICTVTGWGNTQ-YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYP 342 

Qy 3 95 DGRADACQGDSGGPLVCPDG DTWRLVGWSWGRACAEPNHPGVYAKVAEFLDWI 44 8 

:| MIIIIIIM M I III hIMI M MM IhM Ml 

Db 34 3 EGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWI 4 00 



RESULT 5 

US-10-607-565-108 

Sequence 108, Application US/10607565 
Publication No. US20040048294A1 
GENERAL INFORMATION: / 
APPLICANT: Rosen etv^l . 

TITLE OF INVENTION: 31 Human Secreted Proteins 
FILE REFERENCE: PZ033P1 

CURRENT APPLICATION NUMBER: US/lO/607 , 565 
CURRENT FILING DATE: 2003-06-27 
PRIOR APPLICATION NUMBER: US/0 9/531,119 
PRIOR FILING DATE: 2000-03-20 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/101,546 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-09-23 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/102,895 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-10-02 
NUMBER OF SEQ ID NOS : 14 0 
SOFTWARE: Patentin Ver . 2.0 
SEQ ID NO 108 
LENGTH: 4 80 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-607-565-108 

Query Match 91.9%; Score 22 80; DB 12; Length 4 80; 

Best Local Similarity 99.8%; Pred. No. 2.9e-198; 

Matches 420; Conservative 0; Mismatches 1; Indels 0; Gaps 0 
Qy 3 7 SQAVCWRSMRRGCAVLGALGLLAGAGVGSWLLVLYLCPAASQPISGTLQDEEITLSCSEA 96 

IIIIIIIIIIIIIIMIMIIIIMIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIM 

Db 60 SQAVCWRSMRRGCAVLGALGLLAGAGVGSWLLVLYLCPAASQPISGTLQDEEITLSCSEA 119 

Qy 97 SAEEALLPALPKTVSFRINSEDFLLEAQVRDQPRWLLVCHEGWSPALGLQICWSLGHLRL 156 

IIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIIMIIIIIIIIIIIIIII 

Db 12 0 SAEEALLPALPKTVSFRINSEDFLLEAQVRDQPRWLLVCHEGWSPALGLQICWSLGHLRL 17 9 

Qy 157 THHKGVNLTDIKLNSSQEFAQLSPRLGGFLEEAWQPRNNCTSGQWSLRCSECGARPLAS 216 

IIIIIMIIIIMIIMIIIIIIIIIMIIMIIIIMIIIIMIIIMIMMIIIIM 

Db 18 0 THHKGVNLTDIKLNSSQEFAQLSPRLGGFLEEAWQPRNNCTSGQWSLRCSECGARPLAS 23 9 

Qy 217 RIVGGQSVAPGRWPWQASVALGFRHTCGGSVLAPRWWTAAHCMHSFRLARLSSWRVHAG 276 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIII 
Db 24 0 RIVGGQSVAPGRWPWQASVALGFRHTCGGSVLAPRWWTAAHCMHSFRLARLSSWRVHAG 2 99 

Qy 2 77 LVSHSAVRPHQGALVERIIPHPLYSAQNHDYDVALLRLQTALNFSDTVGAVCLPAKEQHF 33 6 

IIIIIMIIIMIIIIIIMMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIMII 

Db 3 00 LVSHSAVRPHQGALVERIIPHPLYSAQNHDYDVALLRLQTALNFSDTVGAVCLPAKEQHF 35 9 

Qy 337 PKGSRCWVSGWGHTHPSHTYSSDMLQDTWPLFSTQLCNSSCVYSGALTPRMLCAGYLDG 396 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 360 PKGSRCWVSGWGHTHPSHTYSSDMLQDTWPLFSTQLCNSSCVYSGALTPRMLCAGYLDG 419 

Qy 3 97 RADACQGDSGGPLVCPDGDTWRLVGWSWGRACAEPNHPGVYAKVAEFLDWIHDTAQDSL 456 

MIIIMIIIIIIIIMIIIIIIIIIIMII llllllllllllllllllllllllllll 

Db 42 0 RADACQGDSGGPLVCPDGDTWRLVGWSWGRGCAEPNHPGVYAKVAEFLDWIHDTAQDSL 479 
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